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We report on an NMR study of unlabeled and uniformly 'X^N-labeled 
d(GAGCAGGT) sequence in 1 M NaO solution, conditions under which 
it forms a head-to-head dimeric quadruplex containing sequentially 
stacked C C-G C G'G-G-G and AJ-AJ tetrads. We have identified, 
fer the -first time a slipped A TAT tetrad alignment, Involving reco*- 
niHon of Watson-Crick A T pairs along the major groove edges of oppos- 
ing adenine residues. Strikingly, both Watson-Crick G-C and A T 
pairings within the direct G C GC and slipped A T A T tetrads, 
respectively, occur between rather than within hairpin subunits of the 
dimeric d(GAGCAGGT) quadruplex. The hairpin turns in the head-to- 
head dimeric quadruplex involve single adenine residues and adds to 
our knowledge of chain reversal involving edgewise loops in ON A quad- 
ruplexes. Our structural studies, together with those from other labora- 
tories, definitively establish that DNA quadruplex formation is not 
restricted to G„ repeat sequences, with their characteristic stacked uni- 
form G G G-G tetrad architectures, Rather, the quadruplex fold is a 
more versatile and robust architecture, accessible to a range of mJxed 
sequences, with the potential to facilitate G-C G C and A T- A T tetrad 
through major and minor groove alignment. In addition toG-G-G-G tet- 
rad formation. The definitive experimental identification of such major 
groove-aligned mixed A-T-A-T and GCG'C tetrads within a quadru- 
plex scaffold, has important Implications for the potential alignment of 
duplex segments during homologous recombination, 
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Introduction 

There is an increasing appreciation of the role 
DNA quadruples (reviews* -4 ) may play in bio- 
logical processes ranging from replication, tran- 
scription and recombination (review 9 ) to telomere 
function (review 4 ). The earliest research focused on 
quadruplex formation involving stacked G*GGG 
tetrads (review 6 ), with polymorphism introduced 
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by the directionality of adjacent strands around the 
quadruplex (review 4 ). 

Formation of mixed tetrads could dramatically 
increase the versatility of quadruplex formation by 
allowing adoption of this four-stranded architec- 
ture by sequences other than simple G (( repeats. 
Towards this end, efforts have been made to ident- 
ify and characterize GC-GC tetrads, where a 
pair of Watson-Crick G-C pairs can potentially 
align either through their major groove or their 
minor groove edges. This approach was successful 
in the case of the Fragile X syndrome triplet 
repeat-containing d(GCGGT,GCGG) sequence, 
which dimerizes In solution through head-to-tail 
alignment of hairpins. The resulting quadruplex 
contains G C G*C tetrads, through major groove 
alignment of Watson-Crick G-C pairs (Figure 1(a)). 7 
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Figure 1. Potential G-C-C C tet- 
rad formation involving either (a) 
direct or (b) slipped alignment of 
major groove edges of a pair of 
Watson-Crick G-C pairs. The 
dipped alignment require© a mono 
valent cation to coordinate the 
inwardly oriented acceptor atoms. 
Potential AT AT tetrad formation 
involving either (a) direct or (b) 
slipped alignment of major groove 
edges of a pair of Watson-Crick 
A T pairs. 



Independent structural studies have identified 
minor groove-aligned C*C G C tetrads (Sup- 
plementary Material, Figure Sl(a)) in the crystal- 
line" and solution states.* 

More recent studies have demonstrated that 
G C G'C tetrads aligned through their major 
groove edges can switch between two distinct 
alignment geometries. In the direct alignment 
(Figure 1(a)), the two G-C pairs align directly 
opposite each other, resulting in the acceptor 
atoms (06 and N7) of G pairing with the donor 
group (NH 2 -4) of C Alternately, in the slipped 
alignment (Figure 1(b)), the major groove edges of 
two guanine bases are positioned opposite each 
other. In this case, a monovalent cation (not necess- 
arily in the plane of the tetrad) is needed to coordi- 
nate the four inwardly pointing nitrogen and 
oxygen acceptor atoms on the guanine bases 
(Figure 1(b)), Solution structural studies of the 
adeno associated viral repeat-containing 
d(GGGCT 4 GGGC) sequence have demonstrated 
thai the direct GCG-C tetrad alignment 
(Figure 1(a)) is formed in Na cation solution, 10 
while the slipped GOGC tetrad alignment 
(Figure 1(b)) is formed in K cation solution? 1 Such 
conformational transitions were unanticipated, and 
attest to the diversity of pairing geometries in 
mixed tetrad alignments and the role of cations in 
modulating this transition. 

The major groove-aligned G-C -G-C tetrad has 
now been observed in a range of DNA quad- 
ruplexes 7 ' 10 ' 12 and appears to be a robust tetrad 
motif adopted by a range of ON A sequences. One 
can therefore anticipate potential formati n of the 
corresponding major-groove aligned ATAT 
counterpart, either of the direct (Figure 1(c)) or 



Slipped (Figure 1(d)) alignment type. By contrast 
the corresponding minor groove-aligned ATAT 
tetrad, of the slipped cation-coordination type 
(Supplementary Material, Figure Sl(b)), has been 
reported earlier both in the crystalline 1 * and 
solution 9 states. 

Demonstration of the major groove-aligned 
AT. AT tetrad formation has turned out to be a 
considerable challenge, and after many unsuccess- 
ful attempts, our group has now identified a 
sequence, d(GAGCAGGT), which forms a head-to- 
head dimeric quadruplex (Figure 2a and Sup- 
plementary Material, Figure S2), stabilized by 
direct G-CGC (Figure 2(b)) and slipped 
A T- AT (Figure 2(d)) tetrads, flanking a central 
G-G-G-G tetrad (Figure 2(c)). Strikingly, we 
observe Watson-Crick G-C pairing between mono- 
mer subunits within the G-C. G-C tetrad and Wat- 
son-Crick AT pairing between monomer subunits 
within (he A-T-A-T tetrad in this quadruplex. 
Finally, the edgewise turns in the head-to-head 
dimeric quadruplex involve a single adenine resi- 
due and provide new insights into chain reversal 
in DNA quadruples. 

Results 

Our group scanned a range of sequences con- 
taining G, A and T residues in our attempts to gen- 
erate a major groove-aligned A T A T tetrad, 
stabilized through stacking on a G G G-G tetrad 
within a quadruplex scaffold. This approach has 
worked In the past in our laboratory when we 
have generated stable triads""™ and 
A (G G G G) A hexads, 12 - 19 stabilized through 
stacking on G G G G tetrads within a quadruplex 
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Figure 2. (b) Folding topology of 
u litTaw-tO-lieMu tiiinvric dfaAG- 
CAGGT) quadruplex in 1 M NaCl. 
Th* backbone tracing of the indi- 
vidual strands are shown by thick 
lines and the chain directionality 
indicated by arrows, Pairing align- 
ments lor (b) the G6 C4 G6-C4 tet- 
rad, (c) the G3 G7 G3 G7 tetrad 
and (d) the A2-T8- A2'T6 tetrad. 



scaffold. We report on NMR studies of the d(GAG- 
CAGGT) sequence,, which forms a dimeric quadru- 

— 1 l^t^.^. » 

A'T'AT (Figure 2(c)) tetrad. 



Imlno proton spectra and assignments 

The exchangeable proton spectrum (5*5 to 14.5 
ppm) of the d(GAGCAGGT) sequence was highly 
dependent on added NaCl concentration. At low 
NaCl concentration, the predominant conformation 
gave very broad resonances In slow exchange with 
a minor component that gave narrow resonances 
(Supplementary Material, Figure S3). The equili- 
brium shifted with Increasing NaCl to the confor- 
mation that exhibited narrow resonances, resulting 
in the spectrum shown in Figure 3(a) in 1 M NaCl, 
2mM phosphate buffer (pH 6.6) at 10 W C This 
spectrum exhibits well resolved exchangeable (8*0 
to 15.0 ppm) and non-exchangeable (7.0 to B.7 
ppm) resonances with the total number of imino, 
amino and base proton resonances consistent with 
formation of a single con former in wlutlon. We 
observe two narrow imino protons between 11 and 
12 ppm, a region characteristic of N-H O hydro- 
gen bonds/ and narrow resonances at 13*11 ppm 
and 14.28 ppm, a region characteristic of N-H - • N 
hydrogen bonds 4 (Figure 3(a)). We also observe 
two narrow amino proton resonances between 8.0 

and R.5 pp**» and two additional narrow amino 



proton resonances downfleld-shifted between °.0 
and 9.2 ppm (Figure 3(a)). 

The exchangeable imii^o and amino protons 
have been assigned following analysis of two- 
dimensional data sets on unlabeled and uniformly 
i; t, ia N-labeled d(GAGCAGGT) sequence. An 
expanded 60 ms mixing time NOCSY contour plot 
of d(GACCAGGT) In 1 M Nad at 0 r 'C is plotted 
In Figure 3(b). The NOE cross-peaks are labeled in 
the Figure and the assignments are given in the 
legend. The corresponding NOESY data set 
recorded at a longer mixing time of 200 ms is 
plotted in Supplementary Material, Figure S4, and 
the cross-peaks assignments listed in the legend. 
We could readily distinguish thymine from gua- 
nine imino protons because of their distinct nitro- 
gen chemical shifts in a 1 H- ,5 N HSQC spectrum 
(Supplementary Material, Figure 55). 

We establish formation of an A2-T8 Watson- 
Crick base-pair based on NOEs between the imino 
proton of T8 and the amino (peaks a and a', 
Figure 3(b)) and H2 (peak b, Figure 3(b)) protons 
of A2, In addition, we unexpectedly observe NOEs 
between the H8 and NH^ protons of A2 (peaks n 
and n', Figure 3(b)), consistent with formation of a 
major groove-aligned slipped A2>T8*A2*T8 tetrad, 
schematically outlined In Figure 2(d). 

We establish formation of a G6C4 Watson- 
Crick base-pair based on NOEs between the imino 

proton of G6 and the amino protons of C4 (peaks c 
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figure 3. (a) Proton NMR spectrum (7.0 to 13.0 ppm) of the d(GAGCACCT) quadruplex (8.8 mM in strands) In 

1 M Nad, 2 mM phosphate in Hfl (pH 6.6) at 10°C (b) Expanded NOESY (60 ms mixing time) contour plot come- 
lnting NOE$ between imlno, amino and non-exchangeable protons in the d(GAGCAGGT7 quadruplex in 1 M NoCl, 

2 mM phosphate, H a O (pH 6.6) at 0°C The NOE crou-peaks a to p are assigned as follows: a and a', T8(NH3)- 
A2(NH r 6); b, T8(NH3)-A2(H2); C and e', G6(NHl)-C4(NH l ^); d and d', G3(r^Hl)-G3(NH 2 -2); e, G3(NH1)-G7(H8); f 
and f, G7(NHl)<;7(NH 2 -2); g (observable in 200 ms mixing time NOESY experiment), G7(NH1)-G3(H8); h, G7(NH„- 
2)-<J7(NH a -2); 1, G3(NH r 2)-G3(NH r 2); j and \' f G3(NH 2 -?)-G7(H8); k, C4(NH 2 -4)-C4<NH 2 ^); 1 and V, C4(NH 2 -4>- 
G6<H8); m and m', C4(NH r 4X:4(H5); n and n', A2(NH 2 -6)-A2(H8); o, A2(NH 2 -6)-A2(NH 2 -6); p, G6(H8)-C4(H5). 



and <f t Figure 3(b)). We also detect NOl£s between 
the amino protons of C4 and the H8 proton of G6 
(peaks 1 and V, Figure 3(b)), and between the H5 
proton of C4 and the H8 proton of G6 (peak p, 
Figure 3(b)), consistent with formation of a major 



groove-aligned direct G6 C4 G6 C4 tetrad, sche- 
matically outlined in Figure 2(b). 

The narrow G3 and G7 imino protons exhibit a 
set of NOEs indicative of G3 -G7G3G7 tetrad for- 
mation, schematically outlined in Figure 2(c). 
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These include NOEs between the H8 proton of G7 
with the imino (peak e, Figure 3(b)) and amino 
(peak j and j', Figure 3(b)) protons of G3, and an 
NOE between the H8 proton of G3 and the imino 
proton of G7 (box g, Figure 3(b); observable in 
200 ms NOUSY spectrum, peak g, Supplementary 
Material, Figure S4). 

The NOfc data outlined above for the d(GAG- 
CAGGT) sequence are consistent with formation of 
slipped Ai*Tti'A2'T8 (Figure 2(d)), direct 
G6<C4<G6 C4 (Figure 2(b)) and G3 G7 G3 G7 
(Figure 2c) tetrads. This can be most readily 
achieved by hend*to-head dimerization to form a 
quadruples as shown schematically in Figure 2(a) 
and Supplementary Material, Figure S2. The 
exchangeable imino and amino proton chemical 
shifts of the d(CAGCAGGT) quadruplex are listed 
in Supplementary Material, Table SI. 

Non-gxchangeabje proton spectra 
and asslflnmenta 

The non*exchangeable base and sugar protons 
have been assigned following analysis of two- 
dimensional data sets on unlabeled and uniformly 
,:i C ,s N-labeled d(GAGCAGGT) sequence. An 
expanded 250 ms mixing time NOESY contour 
plot of d(GAGCACGT) quadruplex in 1 M NaCl, 
*H 2 0 buffer at 10 °C is plotted in Figure 4(a), along 
with a tracing of distance connectivities between 
base protons and their own and 5'-flanking sugar 
HI' protons from Gl to T8 in the sequence. The G3 
residue adops a $yn alignment based on the strong 
H8 to its own HI' NOE in a 50 ms mixing NOESY 
stacked plot (Figure 4(b)). The remaining sugar 
proton chemical shifts were obtained from an anal- 
ysis of other regions of the through space NOESY 
contour plot and through bond COSY and TOCSY 
correlations of the assigned sugar Hi' protons with 
the remaining sugar protons within individual 
rings. The non-exchangeable base and sugar pro- 
ton chemical shifts of the d(GAGCAGGT) quadru- 
ple* are listed in Supplementary Material, Table S2, 
The H8 proton of A2 (8.71 ppm) is downfield- 
shifted while the H6 (6.78 ppm) and H5 (5.15 ppm) 
protons of C4 are somewhat upfield-shifted. 

We can also correlate exchangeable imino pro- 
tons with their own H8 protons within individual 
guanine bases, via through bond correlations to 
their C5 ring carbon atoms. 2 " Such a correlation 
experiment on the sample of uniformly ia C, lfl N- 
labeled d(GAGCAGGT) sequence is shown in 
Figure 4(c), 

Hydrogen bond alignments 

We have verified the formation of NOE-based 
G6-C4-G6-C4 (Figure 2(b)), G3G7G3G7 
(I'igure 2(c)) and A2 T8 A2 T8 (Figure 2(d)) tetrad 
alignments, by identifying through bond coupling 
connectivities across N-H- -N hydrogen bonds 21 "** 
within the folded architecture of the uniformly 
■*C,"NMabeW?d d(GAGCACCT) quadruplex In 1 M 



NaCl, H 2 0 buffer at 0°C The amino protons of 
A2, G3 and C4 could be correlated with their 
directly attached nitrogen atoms using this labeled 
sample as shown In Supplementary Material 
Figure S6. 

We observe a N-H"N hydrogen bond connec- 
tivity between the N1H donor of G6 and the 
N3 acceptor of C4 (peak 4, Figure 5(a)) in a 
HNN-COSY experiment 24 recorded on the d(GAG- 
CAGGT) quadruplex, This connectivity provides 
direct support for a Watson-Crick G6C4 align- 
ment (observed couplings as shaded region in 
Figure 2(b)). In addition, we observe HNN-COSY 
coupling connectivities between the NH 2 protons 
of C4 and the N7 of G6 (peaks 1 and V, Figure 5(b)) 
and four bond H(CN)N(H) coupling con- 
nectivities 14 between the N4 of C4 and the H8 of 
G6 (peak 3, Figure 5(c)). These observations verify 
the direct alignment of opposing Watson-Crick 
G6 C4 pairs through their major groove edges 
(observed coupling as shaded region in 
Figure 2(b)), to form the G6 C4 G6 C4 tetrad out- 
lined in Figure 2(b). 

The observed H(CN)N(H) coupling con- 
nectivities 24 between the H8 proton of G7 and the 
N2 of G3 (peak 1, Figure 5(c)) and between the H8 
proton of G3 and the N2 of G7 (peak 2, Figure 5(c)), 
verify the proposed hydrogen bonding patterns 
(shaded regions in Figure 2(c)) around the 
G3 G7 G3 C7 tetrad. 

We observe a N-H -N hydrogen bond connec- 
tivity between the N3H donor of T8 and the Nl 
acceptor of A2 (peak Z Figure 5(a)) in a HNN- 
COSY experiment. This connectivity provides 
direct support for a Watson-Crick A2 T6 align- 
ment In addition/ we observe HNN-COSY coup- 
ling connectivities between the NH 2 protons of A2 
and the N7 of A2 (peaks 2 and 2', Figure 5(b)). 
This connectivity could be either within an adenine 
or between adenine bases positioned opposite each 
other and hydrogen-bonded through their major 
groove edges, as shown for the A2T8A2.T8 tet- 
rad alignment in Figure 2(d). We do not observe 
the corresponding coupling connectivities between 
the NH 2 and N7 of A5, and hence we favor the lat- 
ter explanation over the former, These observations 
verify the slipped alignment of opposing Watson- 
Crick A2T8 pairs through their major groove 
edges, to form the A2 T8« A2 T8 tetrad outlined in 
Figure 2(d). 

Intra-atrand twsus Inter-strand NOE restraints 

The 2-fold symmetry in the dimeric d(GAG- 
CAGGT) quadruplex fold makes it critical to 
unambiguously differentiate between intra*strand 
and Inter-strand contributions for key NOEs that 
define the folding topology in solution. We pre- 
pared a sample containing an equl molar mixture 
of unlabeled and unil formly ,: *C l5 N-labeIed 
d(GAGCAGGT) sequences and recorded 15 N-edi- 
ted K), l3 C,*N-purged fa) NOESY 1 **' (100 ms 
mixing time) spectra in 1 M NaCT-crmtaining H?0 
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Figure 4. (a) Expanded NQESY (250 ms mixing time) contour pk>t correlating NOEn between baf* and sujpr t*r 
arotSns in the d(GAGCAGGT) quadruple* mM in strands) in 1 M NaC), 2mM phosphate, 2 H a O (pH 6.6) at 
\^ ti. ^ijl- t-jc UR wnCfi ara ^mImji^ Ku aamh«Vq. n** NOE rroaa-oeaks a and b are asriizned m rol- 



the fluanine imlno and KB protons by tfirough-bond connectivities to the C5 carbon within individual guanine nngs 
in the uniformly ,A C, 1!, N-labted d(GAGCAGGT) quadruplex (5.3 mM in strands) in 1 M NaCt, 2 mM phosphate, 
, H a O(pH6.6)atO i C 
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Figure 5. Identification of N- 
H"N hydrogen bond alignments 
in uniformly **C, IB N-!abeled 
d(GAGCACGT) (5.3 mM in 
strand?) in 1 M NaCl, 2 mM ohos 
phate, H 2 0 <pH 6.6) at 0 U C 
Expanded 2 Jnn HNN-COSY con- 
tour plots correlating two-bond 
coupling connectivities between 
donor and acceptor nitrogen atoms 
within N-H- N hydrggen-bond 
alignments across base-pairs, (a) 
Two-bond coupling connectivities 
between T8(N3) Imlno donor and 
A2(N1) acceptor nitrogen atoms 
(peak 2) across the A2(N1).TB(N3 
donor) Watson-Crick base-pair and 
between G6(N1) imino donor and 
C4(N3) acceptor nitrogen atoms 
(peak 4) across the C4(N3) G6(N1 
donor) Watson-Crick base-pair, (b) 
Two-bond coupling connectivities 
between C4(N4) amino donor and 
G6(N7) acceptor nitrogen atoms 
(peaks 1,1') 'across C4GV4 ami- 
no). G6(N7) mismatch pair and 
between A2(N6) amino Jonor and 
A2(N7) acceptor nitrogen atoms 
(peaks 2,2') across A2(M6 ami- 
no). A2(N7) mismatch pair (c) 
H(CN)N(H) spectrum showing 
inter-nucleotide H8(iu^)-N2(cv|) and 
Hft((D a )'N4<(0,) cross-peaks. The 
spectrum consisted of 60S 0 2 ) * 60 
(/,) complex points, with 160 transi 
ents per FID. Spectral widths of 
2000 Hz (/,max: 30 ms) and 
6000 Hz </ a max: 76 ms) were used, 
with a relaxation delay of two 
seconds, resulting in a total acqui- 
sition of 12 hours. Coupling con- 
nectivities are observed between 
the H8 of G7 and the N2 of G3 
(peak 1), between the H8 of G3 
and the N2 of G7 (penk 2), as well 
as between the H6 of G6 and N4 of 
C4 (peak 3). A coupling connec- 
tivity was not detectable between 
the H8 of A2 and the N6 of A2. 



buffer solution at O'C, to identify Inter-strand 
NOIib and differentiate them from their intra- 
strand counterparts, for the 50% component in the 



mixture where the quadruplex contains one 
unlabeled and one uniformly labeled strand. We 
observe inter-strand NOEs between the imlno pro- 
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ton of T8 and the amino (p aks 1 and 1', 
Figure 6(a); peaks 1 and V, Figure 6(b)) and H2 
(peak % Figure 6(b)) protons of A2, establishing 
formation of Watson-Crick A2»T6 pairs between 
hairpin subunits in the dimeric d(GAGCAGGT) 
quadruple*. We al9o observe inter-strand NOEs 
between the imlno proton of G6 and the amino 
protons of C4 (peaks 2 and 2', Figure 6(a); peaks 3 
and 3', Figure 6(b)), establishing formation of Wat- 
son-Crick G6-C4 pairs between hairpin subunits In 
the dimeric d(GACCAGGT) quadruples 

We also observe inter-strand NOEs between the 
amino protons of A2 and the H8 proton of A2 
(peaks 4 and 4', Figure 6(a)), putting restraints on 
tne relative alignments of the AT base-pairs 
around the A T- A-T tetrad. Interstrand NOlis are 
also observed between the imino proton of G7 and 
the H2 proton of A2 (peak 4, Figure 6(b)), putting 
restraints on the relative stacking of adjacent 
G3 C7 G3 G7 and A2 T8 A2 T« tetrads in the 
quadruple*. 

Distance restraints and molecular 
dynamics calculations 

Distance restraints associated with exchangeable 
protons (total of 115) were qualitatively deduced 
from NOESY experiments in H 2 0 at two mixing 
times, while those associated with their non- 
exchangeable proton counterparts (total of 178) 
were quantified from NOE buildup curves in 2 H 2 0 
at four mixing times, as outlined in Materials and 
Methods. The observation of a single set of narrow 
resonances for the d(GACCAGGT) sequence at 
temperatures down to 6°C, was consistent with 
formation of a dimeric quadruples containing two 
strands related by a 2-fold symmetry axis. There- 
fore, non-crystallographlc symmetry restraints 
were used during the computations. All distance 
restraints were classified as ambiguous during the 
distance- rest rained molecular dynamics compu- 
tations. Experimentally defined hydrogen bonding 
alignments from NOE patterns on unlabeled 
sample, N-H-N scalar couplings on isotopically 
labeled sample, and intermolecular NOEs on an 
equimolar mixture of labeled and unlabeled 
sample, were used to restrain the G6C4*G6*C4 
(Figure 2(b)), G3 G7 C3 G7 (Figure 2(c)) and 
A2 T8 A2 T8 (Figure 2(d)) tetrads, with the fold- 
ing models retaining these hydrogen bonding 
alignments during the computations. 

The solution structure of the dimeric d(CAG- 
CAGGT) quadruplex In 1 M NaCl was solved by 
molecular dynamics computations guided by 
hydrogen bonding and NOE distance restraints. 
Sixty starting structures were generated for the 
d(GAGCAGGT) 8-mer segment as sets of pairs of 
randomized chains separated by space Intervals of 
50 A, The protocol outlined in Materials and 
Methods involved Initial torsion space dynamics at 
20,000 K followed by Cartesian space dynamics at 
300 K. A subset of ten distance-refined structures 
of the d(OAGCAGGT) quadruplex were identified 



based on a combination of low NOE energies and 
fewest NOH violations. 

Intensity restraints and NOE back calculations 

The subset of ten converged distance-refined 
structures were next refined against the non- 
exchangeable proton NOE intensities associated 
with NOESY spectra recorded at four mixing 
times. These computations utilized a molecular 
dynamics with back calculation protocol outlined 
in Materials and Methods. The NOE violations, 
deviations from covalent geometry and pairwise 
r.m.s.d. values for the ten lowest energy intensity- 
refined structures of the d(GAGCAGGT) quadru- 
plex (less the poorly defined Gl residues) are listed 
in Table 1. 

Structural features 

A stereo view of the ten superpositioned lowest 
energy intensity refined structures of the d(GAG- 
CAGGT) quadruplex (less the poorly defined Gl 
residues) is shown in Figure 7(a). The sugar-phos- 
phate backbone of individual symmetry-related 
hairpins are colored in orange and green, with the 
sequentially stacked loop A5 residue, 
G6'C4-G6 C4 tetrad, G3G7-G3 G7 tetrad and 
A2T8A2T8 tetrad colored in white, magenta, 
yellow and cyan, respectively. Ribbon and surface 
views of one representative refined structure of the 
d(GACCAGGT) quadruplex using the GRASP pro- 
gram are plotted in Figure 7(b) and (c), respect- 
ively. ^ m 

A stick representation of one symiiivKric strand 
of the d(GAGCAGGT) quadruplex (less the poorly 
defined Gl residues) is shown in Figure 8(a). The 
pairing alignments of the G6C4 G6C4 tetrad and 
A2-T8-A2T8 tetrad in this representative struc- 
ture are shown in Figure 8(b) and (c), respectively. 

The stacking geometries between the A5 bases 
(in white) with the G6C4G6C4 tetrad (in 
magenta) is shown in Figure 9(a), while that 
between the G6 C4 G6 C4 tetrad (in magenta) 
and the G3 G7 G3 G7 tetrad (in yellow) is shown 
in Figure 9(b). The overlap geometry between the 
G3'G7.G3.G7 tetrad (in yellow) and the 
A2.T8.A2 T8 tetrad (in cyan) is shown in 
Figure 9(c). 

Discussion 

The NMR-based structural characterization of 
the d(GAGCAGGT) quadruplex was considerably 
aided by the narrow and well* resolved resonances 
of the one and two-dimensional proton NMR spec- 
tra, and the availability of uniformly la C, re- 
labeled sequence, which aided greatly in the reson- 
ance assignment and identification of hydrogen 
bonding alignments. The computations were also 
aided by our ability to distinguish between inter- 
strand and intra-srrand NOEs (and In turn, Wat- 
son-Crick hydrogen bonding alignments) using 
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Table 1. NMK restraints and structural statistics for fl\e Intensity refined structures of the dimeric d(CAOCAGGT) 
quadruple* _ 



A. NMR r&thiint* 
IJishincv restraints 1 

Intra.* residue distance rtalrainfe 

Sequential (i, i + 1) distance restraints 

\jdt\% range ' + 2) distance astro inb 
Other restraints 

Hydrogen bonding restraint** 

Torsion angle restraints* 
lntiMWlly restraints' 

Nun -exchangeable protons (each of 4 mixing timw) 
H. Structural *ltilitlfc)> h complex folluwinx inltmlly refinement 
NOli violation* 

Number >t>2 A 

Maximum violations 

r.m.twi. of violations 
NMK * factor (R i/t ) 

Deviations from Wen I cava lent geometry 
Bond length* 
I kind angles 
hnprop*iTB 

(V-irwUe all heavy atom r,w.4.d. values (10 refined Btrucutres) 
All heavy atowti excluding ol 



Nun -exchangeable 
86 
76 
16 



48 

14 

178 



6.1 ±0,7 
0,27±0.ttt 
0.056±0,02 
0.074±0.003 

0.0ll±0.001 
3.70±0.09 
0.40±0\03 



Exchangeable 
19 
43 
51 



* All distance re»t«lnh wore Met as ambiguous between Intra and inter- residue contributions. 
h These hydrogen bonding restraints arc based on experimental NOK and Vnn «oup» n g data - 

' Residue* A2, C4. AS, C6, G7 and T8 were restrained to % values In the 2l(X±40r range, characteristic of anti glyccwtdic torwon 
angles, While residues C3 were restrained to x values in the 65<±4Gy range, characterbiUc of syn glycosidic torsion angles, identified 
expo, ri men (ally, ■ — 



equi motor mixtures of labeled and unlabeled 
samples of the quadruplex. 

Quadruple* architecture 

The experimental data on the d(GAGCAGGT) 
sequence is consistent with formation a dimeric 
quadruplex in 1 M NaCl solution. Individual 
strands of d(G AGCAGGT) form hairpins, with a 
single adenine, A5, Involved in chain reversal. 
D\ mediation involves head-to-head orientation of 
hairpins (Figures 2(a) and 7; Supplementary 
Material, Figure SI), with individual strands run- 
ning antiparallel relative to their neighbors around 
the quadruplex (Figure 2(a)). 

GGGG tetrad 

We detect a G3(syn)G7{anti)>G3(9yn).G7(dnli) 
alignment around the G G-G-G tetrad in the 
dimeric d(GAGCAGGT) quadruplex. Such an 
alignment about the G G G G tetrad is consistent 
with antiparallel alignment of adjacent strands 
around the quadruplex, and has been reported for 
the solution structures of the thrombin-bindlng 
DNA aptamer quadruplex, 27 ' 2 '* as well as the 
d(GCGGT 3 GCGG) Fragile X syndrome-containing 
sequence quadruplex 7 and the d(GGGCT 4 GGGC) 
adeno-associated virus-containing sequence quad- 
ruplex. 1 " 

G CG C tetrad 

The G6C4C6C4 tetrad (Figures 2(b) and 8(b)) 
is defined by all anti glycosidic torsion angles, con- 



sistent with antiparallel arrangement of adjacent 
strands around the dimeric d(GAGCAGGT) 
quadruplex. This type of direct GCGC tetrad 
alignment (Figure 1(a)) has been observed pre- 
viously for the solution structures of the 
d(GCGGT$GCGG) Fragile X syndrome sequence- 
containing quadruple^ and the d(GCOCT 4 GGGC) 
adeno-associated virus sequence-containing quad- 
ruplex. 10 There is, however, a critical distinction in 
that Watson-Crick G6-C4 pairs are formed 
between rather than within hairpin subunits 
(Figure 8(b)) in the head-to-head dimeric d(CAG- 
CAGGT) quadruplex reported here. This contrasts 
with Watson-Crick G C pairs within hairpin 
subunits in the head-ro-tail dimeric 
d(GCCGT 3 GCGG) 7 and dfGGGCTrfGGGC) 10 quad- 
ruples, reported previously. 

The amino protons of C4 in the d(GAGCAGGT) 
quadruplex resonate at 8.49 ppm and 9.23 ppm. 
The chemical shift of 8.49 ppm is characteristic of a 
cytoslne amino proton hydrogen-bonded to a car- 
bonyl oxygen atom, sucn as would occur with a 
guanine residue in a Watson-Crick G-C pair The 
chemical shift of 9.23 ppm Is consistent with the 
other cytosine amino proton also being hydrogen 
bonded, and its much larger downfield shift may 
be reflective of its acceptors being both a ring nitro- 
gen and carbonyl oxygen, such as would occur in 
a direct G C G-C tetrad (Figure 1(a)). 

ATAT tetrad 

The slipped A2 T8 A2 T8 tetrad (Figures 2{d) 
and 8(c)) contains all anti glycosidic torsion angles, 
with hydrogen bonding along the major groove 
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4); 4, G7(Nrii)-Az<r«). 



edges of opposing adenine bases. This is the first 
report of any type (direct or slipped) of an 
A T- AT tetrad alignment, which involves pairing 
along the major groove edges. This A TAT pair- 
ing alignment brings the H8 proton of one adenine 
in close proximity to the NH, proton of its adenine 
partner (Figure 1 (d)), consistent with the observed 
interrnolecular NOU between these pair of protons 
in mixed labeling experiments (peak 4 and 4', 
Figure 6(a)). Equally important, is our demon- 
stration that Watson-Crick A2 TS pairs are formed 
between rather than within hairpin subunits 
(Figure 8(c)) in the head-to-head dimeric d(GAG- 
CAGGT) quadruplex. 

The amino protons of A2 in the d(GAGCAGGT) 
quadruples renonate at 7.80 ppm and 9.16 ppm. 
The chemical shift of 7.80 ppm Is characteristic of 
an adenine amino proton hydrogen-bonded to a 
carbonyl oxygen atom, such as would occur with a 
thymine residue in a Watson-Crick A<T pair. The 
chemical shift of 9.16 ppm is consistent with the 
other adenine amino proton also being hydrogen 
bonded, and its much larger downfield shift is 

reflect Ivo of Its acceptor being a ring nitrogen, such 



as would occur in a slipped A T A T tetrad 
(Figure 1(d)). 



Single base chain reversal 

Chain reversal involves a single adenine, A5, 
within individual hairpin subunits (Figure 8(a)). 
The A5 base is bracketed by a closing G6 C4 mis- 
match involving pairing between the amino group 
of C4 and the acceptor atoms along the major 
groove edge of G6 (Figure 2(b)). This concept of a 
single base hairpin loop bracketed by a mismatch 
pair, was first reported for single base loops closed 
by sheared G>A mismatches in antiparallel 
duplexes. 29 - 30 There is no pairing between A5 resi- 
dues across from each other (Figure 9(a)) in the 
head-to-head hairpin dimeric d(GAGCAGGT) 
quadruple*. Rather, the AS residues are stacked 
over the adjacent C4 residues (Figure 9(a)), 

We have checked for non-standard backbone 
torsion and sugar pucker pseudo-rotation (P) 
angles within the single residue edgewise turn 
spanning the C4-A!*-G6 segment amongst the 
refined structures of the dimeric d(GAGCAGGT) 

quadruplex. For the C4-A5 step, I he P value for C4 
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Figure 7. (a) Superpoaitioned stereo view of ten intensity refined structured of the dimerlc d(GAGCAGGT) quad- 
ruptex. The backbones of the individual strands pre in orange and green with phosphate oxygen Atoms removed 
for clarity. The A5 residue Is in white, the G6 C4 G6 C4 tetrad In magenta, G3 G7 G3 G7 tetrad in yellow 
and A2'TB-A2'T8 tetrad in cyan, (b) A GRASP ribbon view of a representative intensity refined structure of the 
d(GAGCAGGT) quadruple*. The backbones of Individual strands are colored in blue and red. Guanine; adenine and 
thymine bases on? colored yellow, red and blue, respectively. <c) A GRASP surface view of a representative intensity 
refined structure of the d(GAGCAGGT) quadruples 



of 69,8(^5.7) places It in the C4'-eco range, the y 
value Of 149.vv±40. !)" places it in uic ifwid range 
(in contrast to standard gauche + value of 64 u ), and 
the x v * lu * for A$ of I7u\l(±4.6)* places it in the 
low anli range. 



Base stacking 

There is excellent pyrlmidlne/purlne and pur- 
ine/purine stacking between the bases of the direct 
G6 -C4-G6-C4 tetrad (in magenta) and the 
G3 C7 G3 G7 tetrad (in yellow) in the folded top- 
ology of the d(GAGCAGGT) quadruple* 
(Figure 9(b)). We observe partial stacking between 
the bases of th» C3-G7*G3-G7 tetrad (in yellow) 
and the slipped A2<T8'A2*T8 tetrad (in cyan) in 
the folded topology of the d(GAGCAGGT) quad- 
ruple* (Figure 9(c)). Overall, there is significant 
stacking of bases along the length of the dimeric 
d(GAGCAGGT) quadruplex. 

The stacking pattern between the G3G7G3G7 
tetrad (In yellow) and the slipped A2>T8>A2-T8 
tetrad (in cyan) (Figure 9(c)), positions the imlno 
proton of G7 of one strand in close proximity to 
the NH 2 and H2 protons of A2 of the partner 
strand, consistent with the observed inter-strand 
NOEs (peaks 3 and 3', figure 6(a); peak 4, 

Figure 6(b)) in mixed labeling experiments. 



Inter-strand NOEs 

We have observed a set of weaker inter-strand 
NOEs, in addition to their stronger counterparts, 
shown in Figure 6(a) and (b), that merit further dis- 
cussion. The studies presented in Figure 6 on an 
equi molar sample of unlabeled and uniformly 
ia C, 1!1 N-labeled d(GAGCAGGT) quadruplex in 1 M 
NaCl-containing H 3 0 buffer, were recorded at 
O'C We have also collected a 1fi N-edited («>,), 
13 C, lfl N-purged fa) NOHSY 26 (100 ms mixing 
time) spectrum, with the carrier on the imino- ia N, 
recorded at 10°C, This spectrum had improved 
signal-to-noise, permitting data presentation at a 
lower contour level (as shown In Supplementary 
Material, Figure S7). We observe a set of weak 
inter-strand NOE cross-peaks labeled 5 to 9 in Sup- 
plementary Material, Figure S7, In addition to 
stronger peaks 1 to 4, that were also seen in 
Figure 6(b). 

Peaks 5 and 5', assigned to inter-strand NOKs 
between the imino proton of C7 and the amino 
protons of C4, are consistent with the stacking pat- 
tern shown in Figure 9(b). Peaks 6 and 6', assigned 
to inter-strand NOEs between the imino proton of 
G7 and the amino protons of A2, are consistent 
with the stacking pattern shown in Figure 9(c). 
Peak 9, assigned to an inter-strand NOE between 
the imino of G6 and the H2 of A5, is consistent 
with th stocking pattern shown in Figure 9(a). 
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Figure 8. (a) A stick representation of a symmetric 
half (corresponding to one of the two strands) of a 
representative Intensity refined structure of the d(GAG- 
CAGGT) quadruple*. The color code is the same as out- 
lined In the legend to Figure 7. (b) Pairing alignment of 
the G6 C4 G6 C4 tetrad, (c) Pairing alignment of the 
A2 T8 A2 TB tetrad. 




By contrast, peak 7 assigned to an inter-strand 
NOI* between the imino of G3 and the H8 of A2, 
cannot be explained by the overlap pattern in 
Figure 9(c); Peak 8, representing an Inter-strand 
NOIZ assigned to the imino of G3 and the H8 pro- 
ton of C6/G7, cannot be explained by the overlap 
pattern shown in Figure 9(b). In both cases, while 
the inter-strand distances between proton pairs are 
long, their intra-strand counterparts are in close 
proximity. Thus, it could be argued that perhaps 
peaks 7 and 8 reflect relatively strong intra-strand 
NOlis that are not completely removed by isotope 
filtering. For this explanation to be valid, peakB 7 
and H should have hcon doublets, because ): *C 



Figure 9. Base stacking overlap patterns in a represen- 
tative intensity refined structure of the d(GAGCAGGT) 
quadruple* (a) Stacking of A5 (in white) on the 
G6-C4 G6 C4 tetrad (in magenta), (b) Stacking of the 
G6 C4 G6 C4 tetrad (in magenta) on the G3 G7 G3 G7 
tetrad (in yellow), (c) Stacking of the G3 G7 G3 G7 tet> 
rad (tn yellow) on the A2 TB - A2 TB tetrad (in cyan). 



decoupling was not acquired during acquisition. 
Currently, we are somewhat at a loss to account 
for this discrepancy involving weak peaks 7 and 8 
(Supplementary Material, Figure S7). whose inteiv 
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airy did vary in the weak range when the exper- 
iments were checked for reproducibility. 

Sett dependence 

The broad resonances for the d(GAGCAGGT) 
sequence in low NaCl solution (Supplementary 
Material, Figure S3(a)) is indicative of aggregate 
formation. What was unexpected was the tran- 
sition to narrow resonances in 1 M NaCl solution 
(Supplementary Material, Figure S3(b)), resulting 
in the formation of a dimeric d(GAGCAGGT) 
quadruplex, amenable to structural characteris- 
ation, Both the direct GCGC (Figure 1(a)) and 
slipped A T A T (Figure 1(d)) tetrads have 
inwardly pointing amino groups, an alignment not 
conducive to coordinating monovalent cations 
positioned either between or within tetrad planes. 
Therefore, it Ui conceivable that 1 M NaCl has 
minimal effect on the integrity of the dimeric 
d(GAGCAGGT| -quadruple* but for some 
unknown reason may destabilize the aggregated 
species that predominates In low salt solution. 

Comparison with mixed tetrads Involving 
minor groove alignment of G C and A T pairs 

Our structural studies on major groove-aligned 
direct G<C-G-C and slipped A T. A T tetrads in 
the head-to-head dimeric d(GAGCAGGT) quadru- 
plex (this study) can be compared with reported 
structural studies on quadruplexes involving 
minor groove-aligned direct G ■ C -G *C (Sup- 
plementary Material, Figure Si (a)) and slipped car- 
iu n-cGorii I u Ti tcci A'T A T (Suppl^rricntsry 
Material, Figure Sl(b)) tetrads formed oy dimeriza- 
tlon of linear and cyclic octameric sequences,*'*'' 14 
Interestingly, the minor groove-aligned direct 
GCGC (Supplementary Material, Figure SI (a)) 
and slipped cnti on-coordinated A TAT (Sup- 
plementary Material, Figure Sl(b)) tetrads also 
involve Walson-Crick GC and A T pairing 
between rather than within head-h>head hairpin 
dimers of the quadruplex. 

It should be noted that while the major groove- 
aligned direct GCGC (Figure 1(a)) and slipped 
AT'A'T (Flgui-e 1(d)) tetrads adopt planar geo- 
metries in the solution structure of the dimeric 
d(GAGCAGGT) quadruplex (this study), the minor 
groove-aligned direct G C G C (Supplementary 
Material Figure S2(a)) and slipped cation-coor- 
dianted AT- AT (Supplementary Material, 
Figure S2(b)) tetrads adopt distinctly non-planar 
geometries within their quadruplexes, both in the 
crystalline* 14 and solution 9 states. 

Biological relevance 

We have reported the first experimental demon- 
stration for formation of a major groove-aligned 
slipped ATA T tetrad (Figure 1(d)) within a 
DNA quadruplex architecture. This result, together 
with our previous demonstration of major groove- 



aligned direct GCGC tetrad (Figure 1(a)) 7 for- 
mation within a quadruplex architecture, expands 
on the code for alignment of duplex segments that 
can potentially participate in strand exchange 
during homologous recombination. The glycoside 
bonds are directed outwards in major groove 
aligned tetrads (Figure 1), while they arc directed 
inwards In minor groove-aligned tetrads (Sup- 
plementary Material, Figure Si). There is steric 
crowding between the inwardly pointing sugars in 
the minor groove-aligned tetrads, resulting in base- 
sugar and sugar-sugar Interactions which severely 
buckle the planes of the tetrads.*- 0 - 1 * By contrast 
there is no such crowding between the outwardly 
pointing sugars in the major groove-aligned mixed 
tetrads, resulting in a planar arrangement of bases 
within the tetrad. Several challenges remain 
despite our identification here of a major groove- 
aligned slipped A*T« A T tetrad (Figure 1(d)) at 
one end of a quadruplex. Can one identify for- 
mation' -of a major groove-aligned direct A T- A T 
tetrad (Figure 1(c)), and can either the direct or 
slipped AT'A'T tetrads be positioned in the 
interior of the quadruplex? 

McGavin 31 was the first to recognise the poten- 
tial for self-pairing of Watson-Crick G C and A T 
pairs to form dyad axes related major groove- 
aligned CCGC and A T A T tetrads. Such tet- 
rad-based quadruplexes could play a role in strand 
exchange between two homologous duplexes 
during genetic recombination. Wilson 112 next pro- 
posed a quad rup lex-based model for formation of 
reciprocal heteroduplexes from their homologous 
duplex counterparts. In this model, homologous 
duplexes initially associate through G'C'G*C a n d 
A'T A T tetrad formation along their minor 
groove edges, placing complementary strands 
opposite each other. Strand exchange can then 
occur following a 90 w rotation of each base, such 
that the base-pairs now race each other through 
GCGC and A -T A T tetrad formation along 
their major groove edges. Experimental studies^* 8 
have provided evidence for quadruplex formation 
by poly (CA) (J . poly (TG) H repeats, while compu- 
tational studies 1 * have established that such strand 
exchange quadruplexes, as well as quadruplex- 
duplex junctions, are stereochemical^ robust and 
form Htable entities* Thus, a future challenge 
would be to design a DNA quadruplex consisting 
solely of G C G C and A-T A T tetrads and 
attempt to trap structures where the alignment is 
through the minor groove on the one hand, and 
through the major groove on the other. 



Materials and Methods 



Preparation of unlabeled and uniformly 1, C, "re- 
labeled DNA 

The unlabeled d(GAGCAGGT) sequence was syn- 
thesized on a 10 |imo1 scale on an Applied Biosystems 

392 DNA synlhesi/er using solid phnse (V-cyanoethyl- 
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phosphnamidite chemistry and was subsequently puri- 
fied by high pressure liquid chromatography (HPIC). 

A modified version of the Zlmmer & Cromers^ 
procedure as dftjcribed"* 2 * was used for the enzymatic 
synthesis of uniformly '\V 9 N-labeled d(GAGCAGGT) 
**H"ence. The In-House prepared uniformly 'YV"N- 
labeled dNTPs 19 * were used as building blocks in the 
in ititro polymerization reaction catalyzed by murine 
mammary leukemia virus (MMLV) reverse transcriptase 
(Cibco-BRL). The uniformly "X'^N-labekd d(GAG- 
CAGGT) 8-mer was separated from the unlabeled 24- 
mer template ujing 22% (w/v) denaturing polyacryl- 
amide electrophoresis. The DNA 8-mer bands were 
eluted from the gel by "crush and soak" procedure and 
purified as described above for the non-labeled samples. 

NMR data collaciion and processing 

NMR data on the d(GAGCAGGT) 8-mer in H 2 0 and 
*R 2 0 buffer (1 M NaCl 2 mM phosphate (pH 6.6)) were 
collected on a Vnrian 600 MHz Unity Itiova NMR spec- 
trometer. Proton assignments are based„on homonucleor 
NOESY, correlation spectroscopy (COSY), TOCSY and 
HCOSTH TOCSY experiments. Data sets were processed 
and analyzed using the FELIX program (Molecular 
Simulations). 

Two bond *7 NN scalar couplings between Imino and 
amino donors fnd nitrogen acceptors in uniformly 
^"N-labeled diGAGCAGCJ) in 1 M NaCl were moni- 
tored in HNN-t'OSY"' 25 and H(CN)N(H) M ^ contour 
plots using pulse sequences described in the literature. 

Dlatanca reatralnts 

The distances between non -ex changeable protons 
were estimated from the buildup curves of cross-peak 
immensities in NOESY spectra at four different mixing 
times (50, 150, :*00 and 250 ms) in a H 2 0 and given 
bounds of ±-30% with distances referenced relative to 
the cytosine H6-H5 distance of 2.47 A. Exchangeable 
proton restraints are based on NOESY data sets at two 
mixing times (60 und 200 ms) in H a O. Cross-peaks Invol- 
ving exchangeable protons were classified as strong 
(medium to strong intensity at 60 ms), medium (weak 
intensity at 60 ms) and weak (observed only at a mixing 
rime of 200 ms) and proton pairs were then restralnecl 
respectively, to distances of 3.0(±0,9) A, 4,0(±1.2) A and 
6.00fc7.8) A, Since- the experimental NMR data are con- 
sistent with a dhneric quadruptex motif, non-crystallo- 
graphic symmetry restraints were imposed on all heavy 
atoms, 

Structure celculntlone 

The structure of the d(GAGCAGGT) in 1 M NaCl was 
determined by molecular dynamics (MD)-simulated 
annealing computations driven by NOE distance and 
hydrogen bondinj; restraints using the X-FLOR package, 
version 3.8? 7 At :he initial stage of the refinement, tor- 
sional molecular dynamics was undertaken at high tern 
perature. The molecules were equilibrated at 20,000 K 
(30,000 steps over 3 ps) «nd then cooled very slowly to 
1000 K (40,000 steps over 20 ps). The potential energy 
function included a repulsive force field, NOE and 
hydrogen bond distance restraints, glycosidic bond (x) 
dihedral angle restraints and a non-crystallographic sym- 
metry potential. The force constant for NOE distance 
restraints was maintained at a value of 30 kcal mol 1 



A *, while for hydrogen bonds restraints the value was 
50 kcal mol 1 A a . All NOE distance restraints were con- 
sidered as ambiguous and treated with the "sum" aver- 
aging option.^* Dihedral angle restraints (210(^40)^, 
with force constant of 50 kcal mol 1 rad 3 ) were 
imposed on glvcosidic torsion angles for the residues A2, 
C4, A5, G6, G7 and T8 shown experimentally to adopt 
null conformations. Dihedral angle restraints (65(±40r 
with force constant of 50 kcal mol 1 rad 3 ) were 
imposed on glycosidic torsion angles for the G3 residue 
shown experimentally to adopt *yn coniormatioa The 
force constant for non-crystallogrnphic symmetry was 
maintained at 30 kcal mol" 1 A* a . 

These compulations were followed by lower tempera 
ture Cartesian space molecular dynamics guided by the 
hydrogen bonding and NOE distance restraints with 
changes in the potential energy function: the repulsive 
force field was replaced with Lennard-Jones potentials 
and planarlty restraints were Included for tetrad planes 
with low weights of 5 kcal mol 1 A a and 10 kcal mol 1 
A "f respectively. During this stage of the dynamics, the 
structures were further cooled from 1000 K to 300 K 
(20,000'steps' over io ps) and minimized unrii the gradi- 
ent of energy was less than 0.1 kcal mol 1 . It should be 
noted that computations repeated without planariry 
restraints resulted in the same low energy structures, but 
exhibited a lower convergence rate. 

The refinement protocol started from 60 different 
initial structures- The initial structures were generated as 
sets of two chains, each eight nucleotides Jong, random- 
ized for all dihedral angles, and separated by space inter- 
vals of 50 A. The convergence rate following dynamics 
was good for the case where the computations were 
guided by hydrogen bonding restraints associated with 
the topotogy shown in Figure 2(a); Sixteen structures out 
of 60 emerged with the same fold and palrwise r.m.s,d. 
values less than 1.0 A between members of the group. 
Non^onverGed structures were separated from that 
giwup by large gaps (in total mon* than 400 kcal) in all 
components of the potential energy (van der Waals, 
NOE violations, covalent geometry). 

The ten converged distance refined structuies conv- 
sponding to the folding topology shown schematically in 
Figure 1(a) were used as the starting point for sub- 
sequent X-PLOR based energy minimization with back- 
calculation of the NOESY spectra. The relaxation matrix 
was set up for the non exchangeable protons, with the 
exchangeable imino and amino protons exchanged for 
deuterons. A total of 712 non-exchangeable intensity 
values from NOESY data sets at four mixing times in 
2 H 2 0 buffer (178 non-exchangeable intensities per mix- 
ing time) were included with force constant of 500 kcal 
mol \ The planariry restraints were set to very low 
values of 1,0 kcal A "* for the G G G G tetrad and 
0,5 kcal A 1 for the A T- A T and GCGC tetrads. The 
distance restraints were retained with 30% bounds and 
the same weights as before. During minimization, the 
NMR R factor (*„*) improved from the initial value of 
12% to 7.4% while retaining structure convergence and 
stereochemistry. 



Coordinates deposition 

Coordinates (accession number Ijvc) of the dimeric 
d(CAGCAGGT) quadruptex have been deposited in the 
RCHB Protein Data Bank. 
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